The role of 'delta' features in speaker verification
نویسندگان
چکیده
Our previous experiments in Text-Dependent and -Independent Speaker Verification (TD-SV and TI-SV) using trajectory-based models, showed that non-stationary segments benefit TD-SV but not TI-SV, because in TI-SV maximum likelihood (ML) training results mainly in stationary segments. This result questions the role of non-stationary, ‘delta’ parameters in conventional GMM-based TI-SV. In this paper we develop and study a number of GMM-based TI-SV systems for Switchboard which use combinations of static and dynamic parameters. We show that in our segmental GMM and the AFRL GMM system, the trajectory slopes and deltas focus the verification process onto the stationary regions. In our GMM systems, however, the deltas are modelling some speech dynamics. The different functions of deltas may be due to different system settings and frontend processing (e.g. RASTA, speech noise detector). This indicates that the role of delta parameters in GMM-based speaker verification systems is more complex than simply “modelling dynamics”. Our results also show that the superior performance obtained with front-end parameterizations which combine static and delta parameters only emerges after RASTA filtering; without RASTA filtering a ‘delta-only’ front-end performs best.
منابع مشابه
Application of shifted delta cepstral features in speaker verification
Recently, Shifted Delta Cepstral (SDC) feature was reported to produce superior performance to the delta and delta-delta features in cepstral feature based language identification (LID) systems [1, 2]. This paper examines the application of SDC features in speaker verification and evaluates its robustness to channel mismatch, manner of speaking and session variability. The result of the experim...
متن کاملSelection of the best set of shifted delta cepstral features in speaker verification using mutual information
Shifted delta cepstral (SDC) features, obtained by concatenating delta cepstral features across multiples speech frames, were recently reported to produce superior performance to delta cepstral features in language and speaker recognition systems. In this paper, the use of SDC features in a speaker verification experiment is reported. Mutual information between SDC features and identity of a sp...
متن کاملSpeaker Verification with Shifted Delta Cepstral Features: Its Pseudo-Prosodic Behavior
This paper examines the linear relation between Shifted Delta Cepstral (SDC) features and the dynamic of prosodic features. SDC features were reported to produce superior performance to ∆ features in Language Identification and speaker recognition systems. A selection of more correlated SDC features is used in speaker verification to evaluate its robustness to channel/handset mismatch. The expe...
متن کاملNoise robust speaker verification with delta cepstrum normalization
This paper introduces a delta cepstrum normalization (DCN) technique for speaker verification under noisy conditions. Cepstral feature normalization techniques are widely used to mitigate spectral variations caused by various types of noise; however, little attention has been paid to normalizing delta features. A DCN technique that normalizes not only base features but also delta-features was r...
متن کاملLimited Data Speaker Verification: Fusion of Features
The present work demonstrates experimental evaluation of speaker verification for different speech feature extraction techniques with the constraints of limited data (less than 15 seconds). The state-of-the-art speaker verification techniques provide good performance for sufficient data (greater than 1 minutes). It is a challenging task to develop techniques which perform well for speaker verif...
متن کامل